Beyond linear transforms: efficient non-linear dynamic adaptation for noise robust speech recognition

نویسندگان

  • Steven J. Rennie
  • Pierre L. Dognin
چکیده

In this paper, we present new theory and results that combine constrained Maximum Likelihood Linear Regression (MLLR), known as feature space MLLR (fMLLR), a state-of-the-art model adaptation technique, with Dynamic Noise Adaptation (DNA), a state-of-the-art noise adaptation algorithm. We explain how DNA implements a highly non-linear transform on speech model features, and why DNA is better suited for compensating for additive noise than fMLLR. Tests results are presented on the DNA + Aurora II framework, which is based upon a collection of challenging in-car noise recordings, as a function of SNR. The results demonstrate that DNA significantly outperforms block fMLLR on additive noise, and that DNA + fMLLR outperforms the ETSI advanced front-end (AFE) system + fMLLR by a significant margin (over 7% absolute).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Linear transformations in sub-band groups for speech recognition

Linear transforms have been demonstrated to successfully achieve on-line speaker and environmental adaptation for robust recognition. This paper explores the gains in computational speed, speaker adaptation convergence rate and recognition performance obtained through the use of multi-resolution sub-band linear transforms in speech recognition. A useful feature of multiresolution processing is ...

متن کامل

An Efficient Block-based Dynamic Range Adjustment Method in Noise-robust Continuous Speech Recognition

This paper proposes a new technique for speech feature estimation under noise circumstances. This new approach yields noise-robust continuous speech recognition (CSR). Noiserobust techniques for isolated word speech recognition typically employ the running spectrum analysis (RSA), the running spectrum filtering (RSF) and the dynamic range adjustment (DRA) methods. Among them, only RSA has been ...

متن کامل

Performance improvement of exemplar-based noise robust ASR using speaker normalization methods

The Automatic Speech Recognition (ASR) technology now has been proved to have desirable performance on an environment that is exactly the same as their observed in training the recognition model. The HMM is the most popular and successful stochastic approach to speech recognition in general use, due to the existence of elegant and efficient algorithms for both training and recognition. However,...

متن کامل

Dynamic Robust Speech Recognition

Robust recognition theory has become one of research focuses of acoustic speech recognition. Acoustic speech digital signal is a random process repeatedly alternating stationary pieces with non-stationary pieces. However both the current linear and stationary characteristic parameters drawn from such signals and the rigid recognition models do not adapt to such repeatedly alternating property o...

متن کامل

Fast speech adaptation in linear spectral domain for additive and convolutional noise

In this paper, we propose a transform-based adaptation technique for robust speech recognition in unknown environments. It uses maximum likelihood spectral transform (MLST) algorithm with additive and convolutional noise parameters. Previously many adaptation algorithms have been proposed in the cepstral domain. Though the cepstral domain may be appropriate for the speech recognition, it is dif...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008